leaf node
RGMDT: Return-Gap-MinimizingDecisionTree ExtractioninNon-EuclideanMetricSpace
In this paper, we establish an upper bound on the return gap between the oracle expert policy and an optimal decision tree policy. This enables us to recast the DT extraction problem into a novel non-euclidean clustering problem over the local observation and action values space of each agent, with action values as cluster labels and the upper bound on the return gap as clustering loss.
Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > United States > California > Monterey County > Monterey (0.04)
- Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
- Europe > Finland > Northern Savo > Kuopio (0.04)
Technology:
Country:
- Europe > Netherlands > South Holland > Leiden (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Technology:
- Information Technology > Data Science > Data Mining (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
- (2 more...)
Country:
- North America > Canada > Quebec > Montreal (0.04)
- Asia > Middle East > Jordan (0.04)
Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Country:
- Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.05)
- Europe > Finland (0.04)
- North America > United States (0.04)
- (2 more...)
Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Country:
- North America > United States (0.04)
- North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
- North America > Canada > Ontario > Toronto (0.04)
- (3 more...)
Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Country:
- North America > United States > New York > Tompkins County > Ithaca (0.04)
- Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)
Technology:
Country:
- North America > United States (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Europe > France (0.04)
- (2 more...)
Industry:
- Law (0.46)
- Information Technology > Security & Privacy (0.46)